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Circular splicing systems are a formal model of a generative mechanism of circular words, inspired 
by a recombinant behaviour of circular DNA. Some unanswered questions are related to the com- 
putational power of such systems, and finding a characterization of the class of circular languages 
generated by circular splicing systems is still an open problem. In this paper we solve this problem 
for complete systems, which are special finite circular splicing systems. We show that a circular lan- 
guage L is generated by a complete system if and only if the set Lin(L) of all words corresponding 
to L is a pure unitary language generated by a set closed under the conjugacy relation. The class 
of pure unitary languages was introduced by A. Ehrenfeucht, D. Haussler, G. Rozenberg in 1983, 
as a subclass of the class of context-free languages, together with a characterization of regular pure 
unitary languages by means of a decidable property. As a direct consequence, we characterize (reg- 
ular) circular languages generated by complete systems. We can also decide whether the language 
generated by a complete system is regular. Finally, we point out that complete systems have the same 
computational power as finite simple systems, an easy type of circular splicing system defined in the 
literature from the very beginning, when only one rule is allowed. From our results on complete 
systems, it follows that finite simple systems generate a class of context-free languages containing 
non-regular languages, showing the incorrectness of a longstanding result on simple systems. 



1 Introduction 

The notion of linear splicing systems was first introduced in |[T6l , where Head modelled a recombinant 
behaviour of DNA molecules as a particular operation between words in a formal language. Subse- 
quently, circular splicing systems were introduced in IfTTl along with various open problems related to 
their computational power. In the circular context, the splicing operation acts on two circular DNA 
molecules by means of a pair of restriction enzymes as follows. Each of these two enzymes is able to 
recognize a pattern inside one of these two circular DNA molecules and to cut the molecule in the middle 
of such a pattern. Two linear molecules are produced and then they are pasted together by the action of 
ligase enzymes. Thus a new circular DNA sequence is generated |fl6l [T8l 1221 . For instance, circular 
splicing models the integration of a plasmid into the DNA of a host bacteria. A language-theoretic op- 
eration can be defined and, depending on whether or not these ligase enzymes substitute the recognized 
pattern, we have Pixton's definition or Head's and Paun's definition. 
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In the circular context we deal with circular words, i.e., equivalence classes with respect to the 
conjugacy relation ~, defined by xy ~ yx, for x,y £ A* ll20l . Loosely speaking, a circular word is a 
word w written on a circle and all the words that are equivalent to w can be obtained by reading the 
letters on the circle, starting from any point on it. We can then consider sets of circular words or circular 
languages. In short, a circular splicing system is a triple (A,I,R) where A is a finite alphabet, / is the 
initial circular language and R is the set of rules, having the form mi#M2$W3#«4, with w, G A*, 1 < i < 4. 
Splicing rules are iteratively applied starting from /. The corresponding circular splicing language is 
the smallest language which contains / and is invariant under iterated splicing by rules in R. Finding a 
characterization of the class of circular languages generated by circular splicing systems is still an open 
problem. Partial results are known for Pixton (resp. Paun) circular splicing systems S = (A,I,R) with 
/ and R satisfying additional assumptions, namely R is assumed to be a reflexive, symmetric set of rules 
and self-splicing is allowed Ifl8ll23l . 

In this paper we consider classes of splicing systems where the splicing operation is of a simpler 
form and for which finding a relationship with regular circular languages is a quite natural question 
investigated in this research area. Precisely, we consider Paun systems 5 with both / and R finite sets 
(finite Paun systems) and such that a#l$b#l € R, for each a,b £A, where 1 is the empty word and A is 
the alphabet of /. In this case the splicing operation is very easy: it applies to any pair of circular words, 
~xa,~yb, and gives as a result the circular word ~xayb. Therefore, the splicing operation can be seen 
as a concatenation followed by a closure under the conjugacy relation. We have called these systems 
complete. 

The main contribution of this paper is the characterization of the computational power of complete 
systems. Indeed, we show that a circular language L is generated by a complete system if and only if 
Lin{L), the set of all words such that the corresponding circular words are in L, is a pure unitary language 
generated by a set closed under the conjugacy relation. 

The class of pure unitary languages is a subclass of the class of context-free languages, introduced 
in lfl2l . Equivalently, a language is pure unitary if it is obtained by applying the iterated insertion 
operation, starting with a finite set Y lfl5l . Thus we show that circular splicing and iterated insertion are 
closely related. In the same paper lPT2l . the authors also characterized regular pure unitary languages by 
means of a decidable property. As a consequence, we characterize regular circular languages generated 
by complete systems and we can also decide whether a complete system generates a regular circular 
language. 

We also show that complete systems have the same computational power as circular simple systems 
with only one rule, i.e., finite Paun systems with R = {a#l$a#l}, a € A. As a consequence, we charac- 
terize (regular) circular languages generated by circular simple systems with only one rule. The special 
class of simple systems was first considered in 11241 using Head's definition and then in [8] by taking into 
account Paun's definition, as the circular counterpart of the case of the linear splicing investigated in ETl . 
In 124 1. the authors claimed that Head simple systems always generate regular circular languages. In this 
paper we show that there are simple systems generating non-regular context-free circular languages and 
we give new contributions towards the solution of the still open problem of finding a characterization of 
the class of circular languages generated by finite circular simple splicing systems. (As a matter of fact, 
an example of a non-regular circular language generated by a simple system has been also reported in 
||5l , see Example l4~4l 

A still unanswered question is whether the above-mentioned characterization of regularity can be 
generalized to larger classes of circular splicing languages, in particular to the class of simple systems. 

The paper is organized as follows. In Section|2]we gathered basics on (circular) words and languages, 
circular splicing, CSSH-systems and simple systems. We also conclude the section with a short descrip- 
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tion of how state of the art the open problems are. In Section [3] we state some known and new results on 
pure unitary languages. Section [4] is devoted to our main results. All the proofs have been omitted for 
space constraint but they can be found in an extended version of this paper [6 ]. The main results of this 
paper were also communicated at AutoMathA 2009 (Liege, Belgium, 8-12 June 2009). 

2 Basics 

2.1 (Circular) words and languages 

We denote by A* the free monoid over a finite alphabet A and we set A + = A* \ 1, where 1 is the empty 
word. For a word w E A*, \w\ is the length of w and \w\ a is the number of occurrences of a in w, a G A. 
We also set alph(w) = {a G A | \w\ a > 0}. A word x E A* is a. factor of w G A* if u\,U2 G A* exist 
such that w = U\XU2- For a subset Y of A*, \Y\ is the cardinality of Y and alph(Y) = \J ye Yalph{y). We 
denote by Fin (resp. Reg) the class of finite (resp. regular) languages over A. Given w G A*, a circular 
word ~w is the equivalence class of w with respect to the conjugacy relation ~ defined by xy ~ yx, 
for x,y G A* ll20l . The notations |~w|, alph(~w) will be defined as |w|, alph(w), for any 

representative w of ~w. We will often use the notation w for a circular word ~w. Let "A* denote the set 
of all circular words over A, i.e., the quotient of A* with respect to ~. Given L C A*, ~L = {~w | w G L} 
is the circularization of L whereas, given a circular language C Q~A* , every L C A* such that ~L = C 
is a linearization of C. In particular, a linearization of ~w is a linearization of whereas the. full 

linearization Lin(C) of C is defined by Lin{C) = {w G A* | ~w G C}. We will often write ~w instead of 
{~w} and L instead of ~L, for a set of letters L C A Given a family of languages FA in the Chomsky 
hierarchy, FA~ is the set of all those circular languages C which have some linearization in FA. Thus, 
Reg~ = {CC ~A* | 3L G Reg : ~L = C}. If C G then C is a regular circular language. It is 
classically known that given a regular (resp. context-free) language L C A*, Lin(~L) is regular (resp. 
context-free) lfl9l . As a result, given C C ~A*, we have C G (resp. C is a context-free circular 
language) if and only if Lin(C) is regular (resp. context-free) [ 18]. 

2.2 Circular splicing 

In this paper we deal with the definition of the circular splicing operation given in lTT8l . The correspond- 
ing circular splicing systems are named here Paun circular splicing systems since they are the counterpart 
of Paun linear splicing systems in the circular context. 

Paun's definition [11811 . A Paun circular splicing system is a triple S = (A,I,R), where A 
is a finite alphabet, / is the initial circular language, with / C ~A* and R is the set of the rules, with 
R C A*#A*$A*#A* and #,$ A. Then, given a rule r = mi#m 2 $M3#«4 and circular words ~V, ~w, 
we set (~w',~w")\- r ~w if there are linearizations w' of ~w', w" of w of ~w such that w' = U2xu\, 
w" = u/ryu-x, and w = U2XU\Un,yu^. If (~w',~w // )r>~w we say that "vv is generated (or spliced) starting 
with ~w" and by using a rule r. We also say that u\U2, U3U4 are sites of splicing. 

We recall that in the original definition of circular splicing given in |[T8l . rules in R could be used 
in two different ways. One way has been described above, while the other, known as self-splicing, will 
not be considered here. From now on, "splicing system" will be synonymous with "circular splicing 
system". 

Given a splicing system S and a circular language C C "A*, we set cr'(C) = {w G ~A* | 3w',w" G 
C, 3r G R : (w f , w")r~ r w}. We also define a°(C) = C, a m (C) = a'(C) U a'(a'(C)), / > 0, and o*(C) = 



Paola Bonizzoni & Clelia De Felice & Rosalba Zizza 



25 



|J J>0 o'(C). Then, L(S) = <J*(I) is the circular language generated by S. A circular language C is .Pawn 
generated (or C is a circular splicing language) if a splicing system 5 exists such that C = L{S). In the 
sequel C(Fin,Fin) denotes the class of the circular languages generated by finite Paun splicing systems, 
where S = (A,I,R) is a finite splicing system if / and R are both finite sets. 

As observed in |fT8ll , we may assume that the set R of the rules in a splicing system S = (A,I,R) 
satisfies additional conditions, having also a biological counterpart. Namely, we may assume that R is 
reflexive (i.e., for each U[#U2$u^#U4 G R, we have u\#U2$u\#U2, u^#U4$u^#U4 G R) or R is symmetric 
(i.e., for each u\#U2$U3#U4 G R, we have ut,#u$u\#U2 G R). We do not assume that R is reflexive. 
On the contrary, we notice that, in view of the definition of circular splicing, if (w',w")\- r w, with 
r = u\#U2$U3#U4, then (w" ,w')\- r < w, with r 1 = u^#U4$ui#U2- Consequently, L(S) = L(S'), where S' = 
(A,I,R') and R' = RU {m3#K4$mi#«2 I wi#W2$"3#"4 G R}- Hence, in order to find a characterization of 
the circular splicing languages, there is no loss of generality in assuming that R is symmetric. Thus, in 
what follows, we assume that R is symmetric. However, for simplicity, in the examples of Paun systems, 
only one of either mi#M2$W3#«4 or ui,#U4$ui#U2 will be reported in the set of rules. 

As already said, in this paper we consider some special classes of finite circular splicing systems. In 
detail, a Paun circular semi-simple splicing system (CSSH system for short) is a finite splicing system 
S = (A,I,R) such that, for each mi#M2$«3#«4 G R, we have \u\U2\ = |«3«4| = 1 0. CSSH systems 
have been considered in (7J[8l|24l, once again as the circular counterpart of linear semi-simple splicing 
systems introduced in lfl4l . If u \U2 = U3U4 G A then S is a simple system [8]. Thus, there are four 
types of rules, namely ai#l$aj#l, ai#l$l#aj, l#a,$a y '#l and l#a,$l#ay, with a,,aj G A. Furthermore, 
since R is symmetric, if a,#l$l#a ; - G R then we also have l#aj$aj#l G R. So, using the terminology of 
EUl, a (l,3)-CSSH system (resp. (1, 3) -circular simple system) is a CSSH system (resp. circular simple 
system) where each rule has the form a,-#l$ay#l, with aj,aj G A. A (2,3)-CSSH system (resp. (2,3)- 
circular simple system) is a CSSH system (resp. circular simple system) where each rule has the form 
l#a,$ay#l, with a/, a,- G A. Finally, a (2,4)-CSSH system (resp. (2, 4) -circular simple system) is a CSSH 
system (resp. circular simple system) where each rule has the form WatSWaj, with at,aj G A. Notice 
that in a (1,3)-CSSH system, circular splicing can be rephrased as follows: given a rule aj#l$aj#l and 
two circular words ~xau ~yaj, the circular splicing yields as a result ~xatyaj. In what follows and in 
order to abbreviate, we will write to denote a rule a,#l$a ; #l in a (1,3)-CSSH system. We will 

focus on a special subclass of these systems, defined in Section|4]and named complete systems. 

2.3 State of the art 

In this section we will give a brief description of how state of the art the open problems on circular 
splicing are. The main result concerning the computational power of circular splicing systems states 
that if S = (A,I,R) is a Paun circular splicing system such that / G FA~, where FA is a full abstract 
family of languages which is closed under cyclic closure (i.e., if L G FA then Lin(~L) G FA), R is a 
finite, reflexive and symmetric set of rules and self-splicing is used, then L(S) G FA~ [18 ]. Other results 
have been proved for finite circular splicing systems Gl[3]|4]]. In particular, it is known that C(Fin,Fin) 
contains regular circular languages (see Q), context-free circular languages which are not regular (see 
EEH), context-sensitive circular languages which are not context-free (see 1 13 ]) and there exist regular 
circular languages which are not in C(Fin,Fin) (see [3 ]). However, the problem of characterizing circular 
languages in Reg~ r\C(Fin,Fin) remains open. We do not even know if, given L in C(Fin,Fin), it is 
decidable whether L is regular. A characterization of C(Fin,Fin) (and of Reg" C\C{Fin,Fin)) has been 
obtained for languages over a one-letter alphabet in (3HU. 

Concerning CSSH systems, in J8), the authors compared the classes of circular languages generated 
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by (/,7')-circular simple systems, for different values of E {(1,3), (2,4), (2,3)}, i.e., for differ- 
ent positions of the letter in the rule. They proved that the class of circular languages generated by 
(1,3) -circular simple systems (resp. (2,3)-circular simple systems) is equal to the class of languages 
generated by (2,4)-circular simple systems (resp. (1,4) -circular simple systems), whereas the class of 
circular languages generated by (1,3) and (2,3)-circular simple systems are not comparable. An anal- 
ogous viewpoint was adopted for Paun circular semi-simple splicing systems in [7] where the authors 
highlighted further differences between circular simple and CSSH systems. Indeed, they proved that the 
class of circular languages generated by (1,3)-CSSH systems and the class of circular languages gen- 
erated by (2,4)-CSSH systems are not comparable. Finally, in [10] the authors proved that the class of 
circular languages generated by (2,4)-CSSH systems is the class of the reversal of the circular languages 
generated by (1,3)-CSSH systems. Loosely speaking, the reversal of a (circular) word is the (circular) 
word written backwards and the reversal of a (circular) language L is the (circular) language consisting 
of the reversals of all its (circular) words. 

In lfl3 1. the author claimed that the class of circular languages generated by CSSH systems is con- 
tained in the class of context-free circular languages. Marked systems, i.e., CSSH systems satisfying 
additional hypotheses, were introduced in (9l [10| with a characterization of the corresponding regular 
circular languages generated. This characterization was reviewed in a graph theoretical setting in Q. 
However, a still open problem is to find a characterization of the class of regular circular languages 
generated by CSSH systems. 

Remark 2.1 As stated in the introduction, the notion of simple splicing systems has been originally 
considered using the Head splicing operation. A Head circular splicing system Sh = (A, I, T,P) is defined 
by giving a finite alphabet A, the initial set / C ~A*, the set T of triples, T C A* x A* x A*, and where P 
is a binary relation on T such that, for each (p,x,q), (u,y,v) S T, (p,x,q)P(u,y,v) if and only if x = y. 
Head defined circular splicing as an operation on two circular words ~hpxq, ~kuxv G "A* performed 
by two triples (p,x,q),(u,x,v) and producing ^hpxvkuxq. The word x is called a crossing of the triple. 
In 0, the authors proved that there is a canonical transformation of a Head system Sh = (A,I,T,P) 
into a Paun system S = (A,I,R) such that S and Sh generate the same language. This transformation 
is defined by R = {px#q$ux#v j (p,x,q),(u,x,v) € T} (Proposition 3.1 in [3]). We notice that this 
transformation defines a bijection between the class of the circular languages generated by (l,3)-circular 
simple systems S = (A,I,R), with R = {(a,a)} and that of circular languages generated by Head splicing 
systems Sh = (A,I,T,P) with T = {(l,a, 1)}, (I, a, l)P(l,a,l). On the other hand, in |[24l . the authors 
discussed Head systems Sh = (A,I,T,P) and the corresponding splicing operation was named action 
SA1 of Sh- Furthermore, a triple in T of the form (I, a, 1), a G A, was said to have null context and 
crossing of length one. The first part of Theorem 3.4 in ll24l claims the regularity of the circular splicing 
language under action SA1 of a splicing system Sh = (A,I,T,P) with / finite and triples in T having 
null context and crossings of length one. On the contrary, Example 14.41 shows that (1,3) -circular simple 
systems S exist with L(S) being a non-regular language, so disproving the above-mentioned Theorem 3.4 
in EH. 

3 Pure unitary languages 

In this section, we deal with a class of languages, named pure unitary languages, already considered 
in |[TT1 [T2l [T5l with the aim of finding conditions under which a context-free grammar will generate 
a regular language. Pure unitary languages (named insertion languages in ifTTTO are a simple class of 
"generalized Dyck languages" and they can be defined in several ways. Here we follow the viewpoint 
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adopted in |fl31 , where these languages are defined by means of the operation of iterated insertion. We 
report some of their known properties and we also give a condition under which a pure unitary language 
is closed under the conjugacy relation (Lemma [3.31) . The operations of insertion and iterated insertion 
are variants of classical operations on formal languages. We recall their definitions below. 

Definition 3.1 H15\l Given Z,Y C A*, the operation of insertion, denoted by <—, is defined by Z <—Y = 
{ziyzi | Z1Z2 € Z andy E Y}. The operation of iterated insertion, denoted by <— *, is defined inductively 
from the operation of insertion by Y^° = {1}, Y^ i+1 = F* - ' <— Y and Y^* = U,>oJ^'. 

Definition 3.2 KT2\ [731/ A language L is a pure unitary language ifL = Y^* with Y being a finite set. 

Since Y^* = (Y \ l)* - *, in what follows, we assume Y C A + . In [12J the authors stated a charac- 
terization of regular pure unitary languages by means of a decidable property. This result is partially 
reported below. 

Theorem 3.1 [12] Let Y be a finite set such that alph(Y) = A. Then L = Y^* is regular if and only if 
Y is subword unavoidable in A*, i.e., there exists a positive integer k such that any word u € A*, with 
\u\ >k, contains as a factor a word ofY. For any regular set R C A*, it is decidable whether or not R is 
subword unavoidable in A*. 

The construction of a grammar generating Y^* is folklore and is reported below. As usual, here 
and from now on we denote by L(G) the language generated by a context-free grammar G. Let w = 
ajj • • -at h € Y, where {a,-, , . . . is a multiset of elements in A. Then, we define the production p w = 
X -> Xa h Xa h ■ ■ Xa ih X. We set G Y = ({X},A,P,X), P = {X -> 1} U {p w \ w G Y} and we shall call 
Gy the pure unitary grammar associated with Y. Our main result follows from the relation Y^* = 
L(Gy) (Proposition 13. II folklore) and from two properties of the grammar Gy, namely the closure of the 
language L{Gy) under concatenation (Lemma [3.21 ) and under the conjugacy relation (Lemma [3.31) . the 
latter property being satisfied under the condition that Y keeps the same property. Lemma [37TI is needed 
in the proof of Lemma l3~3l and the three lemmas show a not yet thoroughly investigated relation between 
insertion, concatenation and closure under the conjugacy relation. 

Lemma 3.1 For each w,z € L(Gy),for each wi,W2 £ A* such that w = W\W%, we have W\ZW2 € L(Gy). 

Proposition 3.1 Given a finite set Y C A*, we have Y^* = L{Gy). 

The following two lemmas are needed in the next section. 

Lemma 3.2 Given a finite set Y C A*, the language Y^* is closed under concatenation, i.e., ifwi,W2 € 
Y^* then w\W2 G Y^*. 

Lemma 3.3 Let Y be a finite set. IfY is closed under the conjugacy relation then L{Gy) is closed under 
the conjugacy relation. 

4 Main Result 

In this section we will state the main results of the paper. Precisely, in Section 14.11 we introduce the 
notion of a complete system S and we state that L is generated by 5 if and only if Lin{L) may be obtained 
by iterated insertion starting with a language closed under the conjugacy relation (Theorem 14- 1 b - A 
regularity characterization of splicing languages generated by complete systems follows by the above 
result (Corollary I4.2I ). Then, in Section l4~2l we discuss the case of (1,3) -circular simple systems with 
only one rule {a, a). We show that these systems have the same computational power as complete systems 
in Proposition 14. 21 Consequently we characterize the corresponding generated languages. 
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4.1 Complete circular splicing systems 

Definition 4.1 A complete system S = (A,I,R) is a (1, 3)-CSSH system such that R = A x A, alph(I) = A 
and 107. 

Example 4.1 Let 5= (A,I,R), where A = {a,b}, I = ~{ab} and/?= {(a,a),(b,b),(a,b)}. Therefore, S 
is a complete system. By using Theorem 14. II we will show that L(S) is a non-regular circular language. 
On the contrary, let S = (A,I',R), where /' = ~{ab,aa,bb}. As we will see below, S is a complete system 
generating a regular circular language. 

Remark 4.1 Let S = (A,I,R) be a complete system. Notice that 1 L(S) since, as shown in Q, for any 
Paun circular splicing system S = (A,I,R), we have that 1 € I if and only if 1 € L(S). 

One of our main results, stated in Theorem 14- 1 1 is based on the following observation. On the one 
hand we have already pointed out that if L = Y^* is obtained by iterated insertion, starting with a finite 
language Y closed under the conjugacy relation, then L is closed under concatenation and under the 
conjugacy relation (Lemmas 13-21 13.31) . On the other hand, by definition the full linearization Lin(L(S)) 
of a circular splicing language L(S) is a language closed under the conjugacy relation. In addition, if S is 
a complete system then Lin(L(S)) is also closed under concatenation (Lemma |4~TT ). 

Lemma 4.1 Let S be a complete system. Then Lin(L(S)) is closed under concatenation, i.e., ifw,w' € 
Lin(L(S)) then ww' E Lin[L[S)). Analogously, if~w,~W € L(S) then £ L(S). 

The above result is strengthened by Proposition 14. 1 1 Indeed we state that Lin(L(S)) is the smallest 
language closed under conjugacy relation, under concatenation and containing I. 

Proposition 4.1 Let S = (A,I,R) be a complete system. Let L be a language such that L is closed under 
the concatenation, L is closed under the conjugacy relation and Lin(I) C L. Then Lin(L(S)) C L. 

Since the language L = (Lin(I))*~* , obtained by iterated insertion starting with Lin{I), satisfies the 
hypotheses in Proposition 14. 1[ we obtain Corollary 14. li as a direct result. 

Corollary 4.1 Let S = (A,I,R) be a complete system. Then Lin{L{S)) C (Lin(I))*~* \ 1. 

By the previous preliminary results we can thus prove Theorem 14. 1 1 stating the connection between 
pure unitary languages, pure unitary grammars and circular splicing languages. 

Theorem 4.1 The following conditions are equivalent: 

(1) There exists a complete system S = (A,I,R) such that L = L(5). 

(2) There exists a finite language Y, with alph(Y) = A, such that Y is closed under the conjugacy 
relation and Lin(L) = L(Gy) \ 1, i.e., L = ~(L(Gy) \ 1). 

(3) There exists a finite language Y, with alph(Y) = A, such that Y is closed under the conjugacy 
relation and Lin(L) = Y^* \ 1, i.e., L = ~(Y^* \ 1). 

Example 4.2 Let S = (A,I,R) be the complete system reported in Example 14.11 i.e., A = {a,b}, I = 
~{ab} and/? = {(a, a), (b,b), (a,b)}. By using Theorem l4.ll we have Lin(L(S)) = {w € {a.b} + \ \w\ a = 
\w\b}- Thus Lin{L{S)) is a non-regular language and consequently L(S) is a non-regular circular lan- 
guage. 

A fundamental consequence of Theorems 13.11 and | 4. 1 1 is stated in Corollary 14.21 
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Corollary 4.2 Let S = (A,I,R) be a complete system. Then, L(S) is a regular circular language if and 
only ifLin(I) is subword unavoidable. Consequently, it is decidable whether L(S) is a regular circular 
language. 

Example 4.3 Let S = (A,I',R) be the complete system reported in Example 14.11 i.e., A = {a,b}, I' = 
~{ab,aa,bb} and R = {(a,a),(b,b),(a,b)}. Of course Lin{l') is subword unavoidable. Thus, in view of 
Corollary 14.21 L(S) is aregular circular language. Actually, Lin(L(S)) = ({a,b} 2 ) + = {w G {a,b}* \ 3k > 
0: \w\ =2k}. 

Remark 4.2 Let S be a complete system. Thus L = Lin(L(S)) = Lin(L(S)) + (Lemma |4~TI ) and LU {1} is 
a star language, i.e., LU {1} is a language which is closed under the conjugacy relation and which is the 
Kleene closure of a language [3 ]. In [3 ], the authors defined a class of regular star languages L by means 
of a property of the finite state automaton recognizing L and such that ~L G C(Fin,Fin). Finding the 
relation between this class of star languages and pure unitary languages is a still unexplored direction. 

4.2 Simple systems 

In this section we show how our characterization of the computational power of complete systems, can 
be extended to a special type of (l,3)-circular simple systems. In detail, a reformulation of Theorem 
14.11 and Corollary 14.21 still hold in terms of (1,3) -circular simple systems having only one rule. This 
reformulation is obtained by using a bijection between these systems and complete systems (Proposition 
I4.2I ). It is clear that, in order to extend this bijection to (1,3) -circular simple systems with a set of rules 
of larger size, we need a generalization of the notion of complete systems. 

Let us briefly give an intuitive description of the results stated in this section. As already said, the 
circular splicing language is obtained by iterated applications of the splicing operation, starting with all 
pairs of circular words in/. Let S = (A,I,R) be a (1,3) -circular simple system where R = {(a, a)}, a £A. 
In this context, given two circular words ~ha, ~ka, the circular splicing yields as a result ~haka. In other 
words the splicing operation is allowed on every position where a appears. Therefore we handle all words 
w in Lin(I) (and in Lin(a l (I))) having the form w = x\a ■ ■ -x^a, where xj G (A \a)*. It is easily seen that 
each of these words w is in the free monoid generated by a finite prefix code F = {x\a, . . . ,x^a, . . .}. 
Therefore, by a coding morphism ctj — > Xja, F is identified with a finite alphabet A' and the rule (a, a) 
is identified with the set of rules A' x A'. Since regular (resp. context-free) languages are closed under 
morphism, we can define a bijection between complete systems S' and (1,3) -circular simple systems S 
with one rule such that L(S') is regular (resp. context-free) if and only if L(S) is regular (resp. context- 
free). Let us state the definitions and results in a precise way. 

Given / = ~{wi, ...,Wk\ 1 < i < k}, the set F C A + associated with / is the set of smallest cardinality 
which satisfies the following conditions: (1) for each word w in F we have w G A* a and \w\ a = 1; (2) for 
each ~w G /, if w' is any linearization of ~w such that w' G A*a then w' G F + . 

Hence F = {xta \ 1 < i < n} is a finite prefix code (i.e., F HFA + = 0) and for each ~w; G /, we 
have ~W; = ^x^ia- - -Xi^a, where j,- G N, \xi jg \ a = 0, for each 1 < g < ji and x^ia, . . . ,Xu t a is a set of 
not necessarily different elements in F. Here, we assume that a G alph{w), for each w G /. Notice that, 
in order to characterize the circular splicing language generated by S, there is no loss of generality in 
making this assumption. Indeed, by using a result stated in iflOl . we have that L(S) = L(S\) U 
where Si = (A,Ii,R) and I\ = {w G / | a G alph(w)}. From now on, S = (A,I,R) will denote a (1,3)- 
circular simple system with R = {(a, a)}, a G A and / = ^{x^ia ■ ■ -x^a \ 1 < i < k}, where \x^ g \ a = 0, 
for each 1 < g < jj. Of course L(S) C ~F + . We now state that the above systems have the same 
computational power as special complete systems. 
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Let (p be a coding moronism for F, i.e., a moronism (p:A'* — > A* which is injective and such that 
F = (p(A') Thus, \A'\ = \F\. Set A' = {(Xi,. . . ,GC„}. We know that <p is defined by <p(a,) = x,a, 
1 < i < n. We also know that (p defines a map from ~A'* into "A* if we set <p(~w) = ~<p(w), for all 
w G A'*, i.e., <p(~w) does not depend on which representative in ~w we choose to define it by lfT8l . Thus, 
if w,w' G A'* and w ~ w' then <p(w) ~ <p(w'). In what follows S 1 = (A', /',/?') will denote the complete 
system defined by /' = (p~ l (I). Thus, A' = alph(I') and = A' x A'. 

Proposition 4.2 We have (p(L(S')) = L(S) and (p^(L(S)) = L(S'). Consequently, L(S) is a regular 
(resp. context-free) circular language if and only ifL(S') is regular (resp. context-free). 

In view of Proposition l4.2l Theorem l4.1l and Corollary I4.2l may be rephrased for (1 , 3)-circular simple 
systems S with only one rule (see Il6l). 

Example 4.4 We recall that a word x G A + is called unbordered if x G uA + n A + u implies u = 1 ifTTl. 
Given w,x G A*, with a" being an unbordered word, we denote by \w\ x the number of occurrences of x 
in w. Let S = (A,I,R) be the (1,3) -circular simple system defined by A = {a,b,c}, I = ~{baca} and 
R = {(a, a)}. In view of Proposition 14. 21 we have L(S) = (p(L(S')), where 5" = (A',I',R ! ) is the complete 
system defined by /' = ~{aj3}, A' = alph(I') = {a, j8} and R' = A' x A'. By using Theorem 14.11 we 
have Lin{L{S')) = {w G {a,j8} + | \w\ a = \w\p} and so L(S') = ~{w G {a,j8} + | \w\ a = \w\p} is a 
(non-regular) context-free circular language (see Example 14. 21 ). By using Proposition 14.21 once again, 
L(S) = (p(L(S')) = ~{w G {a,b,c} + \ \w\ba = \w\ ca \ is a (non-regular) context-free circular language. 
An ad hoc proof of the non-regularity of the circular language L(S) has also been reported in ll5ll . 
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